SODA: A large-scale open site object detection dataset for deep learning in construction
نویسندگان
چکیده
Computer vision-based deep learning object detection algorithms have been developed sufficiently powerful to support the ability recognize various objects. Although there are currently general datasets for detection, is still a lack of large-scale, open-source dataset construction industry, which limits developments as they tend be data-hungry. Therefore, this paper develops new large-scale image specifically collected and annotated site, called Site Object Detection dAtaset (SODA), contains 15 kinds classes categorized by workers, materials, machines, layout. Firstly, more than 20,000 images were from multiple sites in different site conditions, weather phases, covered angles perspectives. After careful screening processing, 19,846 including 286,201 objects then obtained with labels accordance predefined categories. Statistical analysis shows that advantageous terms diversity volume. Further evaluation two widely-adopted based on (YOLO v3/ YOLO v4) also illustrates feasibility typical scenarios, achieving maximum mAP 81.47%. In manner, research contributes development learning-based methods industry sets up performance benchmark further corresponding area.
منابع مشابه
LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop
The state-of-the-art visual recognition algorithms are all data-hungry, requiring a huge amount of labeled image data to optimize millions of parameters. While there has been remarkable progress in algorithm and system design, the labeled datasets used by these models are quickly becoming outdated in terms of size. To overcome the bottleneck of human labeling speed during dataset construction, ...
متن کاملSupplement to: iLab20M: A large-scale controlled object dataset to investigate deep learning
Experiment I: category prediction In this experiment, we randomly select 2K samples from 7 categories (boat, bus, f1car, tank, train, ufo, and van) and feed them to a pretrained CNN model, specifically Alexnet. Having fc7 and pool5 representations of selected samples ready, we use the t-SNE algorithm to reduce their dimensionality to 2D. In addition, 20K images are randomly selected from all 7 ...
متن کاملLarge-Scale Deep Learning on the YFCC100M Dataset
We present a work-in-progress snapshot of learning with a 15 billion parameter deep learning network on HPC architectures applied to the largest publicly available natural image and video dataset released to-date. Recent advancements in unsupervised deep neural networks suggest that scaling up such networks in both model and training dataset size can yield significant improvements in the learni...
متن کاملDOTA: A Large-scale Dataset for Object Detection in Aerial Images
Object detection is an important and challenging problem in computer vision. Although the past decade has witnessed major advances in object detection in natural scenes, such successes have been slow to aerial imagery, not only because of the huge variation in the scale, orientation and shape of the object instances on the earth’s surface, but also due to the scarcity of wellannotated datasets ...
متن کاملLearning with Parsimony for Large Scale Object Detection and Discovery
Learning with Parsimony for Large Scale Object Detection and Discovery
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Automation in Construction
سال: 2022
ISSN: ['1872-7891', '0926-5805']
DOI: https://doi.org/10.1016/j.autcon.2022.104499